CDS

Accession Number TCMCG017C19538
gbkey CDS
Protein Id OMO85142.1
Location join(10293..10326,10898..11247,11916..12002,13920..14225,14452..14698,14772..14892,16244..16651,17231..18932)
GeneID InterPro:IPR001106
Organism Corchorus olitorius
locus_tag COLO4_21739

Protein

Length 1084aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA215141, BioSample:SAMN03160584
db_source AWUE01017761.1
Definition Aromatic amino acid lyase [Corchorus olitorius]
Locus_tag COLO4_21739

EGGNOG-MAPPER Annotation

COG_category Q
Description Belongs to the PAL histidase family
KEGG_TC -
KEGG_Module M00039        [VIEW IN KEGG]
M00137        [VIEW IN KEGG]
M00350        [VIEW IN KEGG]
KEGG_Reaction R00697        [VIEW IN KEGG]
KEGG_rclass RC00361        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K10775        [VIEW IN KEGG]
EC 4.3.1.24        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00360        [VIEW IN KEGG]
ko00940        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko01110        [VIEW IN KEGG]
map00360        [VIEW IN KEGG]
map00940        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map01110        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGTTTTTCCAAACAAGACTGATCGGACTTCAGCTAGTATGCGTATTGTTCAAGAGCCTGTTAGATTAATGCCTGATGTTGTTCATGCGTTCATTGAAGCAAAGGTTTCCTTAGACAGGATCCTTAAATTTCTTGAGGCACCAGAATTGGGAGACAGAAAGCTACTGGAATATTGTAATAATGACATAAAGTCTAAGCACTCAATTTTCATTCGGTCCGATGAAATTTCTTGGGACCTTAATCCTTCCTCCAGACCTACGTTAAGGAACATAGATTTGGTGGTTAAACCGGGCGAAAAGGTGGCTATTTGTGGAGAGGTTGGCTCAGGAAAATCAACTCTTTTAGCTGCTGTTCTTGGAGAAGTTCCAAAAGTCAATGGCATTGAATATGTCTTGAAAGCTTTGTCTGATAAAACAGTCCTGCTTGTGACACACCAAGTTGACTTCCTTCCAGTTTTTGATTCTATCTTGGTTAGATATCAGCCTAATGCTCCATTAGTCCTTAAAGGCATAAGCTGCACATTTGAAGCAGGAAGCAAAATTGGAATAGTTGGCAGGACTGGTAGTGGGAAGACAACTCTTATCAGTGTTTTATTCCGCCTGGTGGAGCCTGCAGATGGAGAAATCATCATTGATAACCTTAACATATGCAAAATAGGGCTTCATGATTTGAGATCACGTTTGGGGATCATCCCTCAAGATCCTACACTTTTTGGTGGTTCTGTCAGATACAATCTAGACCCCTTGGAACAACATAGTGACAATGAGATATGGGAAGTTGCAGTTGTCCAAGATGGATCAAACTGGAGTGTGGGGCAGCGCCAATTGTTCTGCCTTGGACGTGCATTGCTTAAGAGGAGCAGGATATTAGTCCTTGATGAAGCCACTGCATCAATCGACAATGCAACCGACTCTATCATCCAGAAAACTATAAGAACAGAATTTGAAGACTGCACTGTGATAACTGTGGCTCATAGAATACCAACAGTAATGGACTGCAACATGGTGCTTGGTATTAGTGACGGGAAATTGGTGGAGTATGATGAGCCAATGAAGCTGATGAATATGAAAGGATCGTTGTTTGGACAACTAGTCCAGGAATATTGGTCACGTTCTGCTAACAATGATATAAGTCCAGAAGATTGTGAAGTAAAAAGCAATATGGAGGTGTCACAGCGAAAGGAAAATAAGTCACTAGAAAATCCGTACCTCAATGATCCATTAAACTGGGGTTCCGCTGCAGAGTCATTGAAAGGTAGCCATTTGGATGAAGTGAAAAGAATGGTTAATGAATACCGAAAGGCGGTGGTGCAGCTGGGTGGCCAGACACTGACTATAGCGCAGGTGGCTGCAGTGGCGGCTGCTCGTGACAATGGAGGCCGAGTTATGGTTGAGCTAAGTGAGTCGGCCAGGGGCGGTGTTAAGGCAAGCAGTGATTGGGTTATGGATAGTCTAAATAAAGGAACAGATACCTATGGAATTACTACTGGTTTTGGTGCAACTTCACATAGGAGGACTATTCATGGAGCTGCCTTACAAAAGGAGCTCATCAGGTTCTTGAATGCTGGAATCTTTGGCAGTGGCAACACAGAATCTTGCCACACAATGCCACACACAACAACAAGAGCTGGTATGCTAGTTAGAGTCAACACACTCCTCCAGGGCTACTCAGGCATTAGGTTTGAGATCCTGGAAGCTATCATCAAGCTACTAAATCACAACATTACTCCATGCTTGCCTTTGCGTGGCTCCATTTCTGCCTCAGGTGATATAATCCCTTATGCATACATTTCTGGACTTCTCACCGGACGCCCAAATTCCAAGGCTGTTGGTCCCAAAGGAGAACTCCTTGATGCCAAGGAAGCTTTTGACCTTGCAGGCATTGATGGTGGATTTTTTGAGTTGCAGCCTAAAGAGGGTCTAGCCCTTGTTAATGGCACAGGAGTTGGTTCTGGCTTGGCTGCCATAGTTCTCTTTGAGGCTAACATTTTAGCAGTTCTATCACAAGTTTTGTCAGCAATGTTTGCTGAGGTTATGCATGGAAAGCCAGAGTATACAGATCACTTGATTCATAAACTGAAGCACCATCCAGGTCAGATTGAGGCTGCAGCTATAATGGAACATATCCTGGAAGGAAGTGCCTTTATTAAGGCAGCACAAAAATTACATGAAATTGATCCCTTGCAAAAGCCTAAACAAGATCGATACGCTCTTCGTAGTTCCCCACAATGGCTAGGCCCTCAAGTTGAAGTAATTCGATCATCCACAAAGTCCATCGAAAGGGAGATGAATTCCGTGAATGATAATCCATTGATTGATGTGTCAAGAAACAAGGCCTTGCATTGCGGGAATTTCCAAGGTACTCCAATTGGTGTGTCCATGGATAACACAAGATTAGCTATAGCCTCAATTGGGAAACTCATGTTTGCTCAACTATCTGAACTTGTTAATGATATTTACAACAATGGGTTGCCATCAAATCTGTCAGGTGGTGGAAGGCATCCGAGTTTAGATTACGGATTGAAGGGTGCTGAAATAGCCATGGCAGCATATTGCTCGGAGCTACAATATCTGGCAAATCCTGTTACCAATCATGTGCAAAGTACAGAGCAACACAACCAAGATGTGAACTCGTTGGGGCTAATCTCTGCCAGAAAAACAGCTGAAGCTGTTGACATTTTAAAGCTCATGTCATCAACATACATGGTTGCGCTTTGTCAGGCTATAGATTTGAGACACCTGGAAGACAACTTGAAAAATGCAGTAAAAAACACAGTGAGCCAAGTTGCCAAAAAAGTCCTAACCTGCGACAAGGATTTGGTTAAAGTGGTGGATGGTGAGCATGTTTTTTCATATGCTGATGATCCATGTAATGCAAATTATCCACTCATGCAAAAGCTGAGACAAGTCCTGTTACAACATGCCTTGACATTGACAAACGTTAATGCTCTTAAGAATATTGGTGGTTTCGAGGAAGAACTGAAGAGGGTGTTGCCAAGGGAGGTTGAGAGAGCGAGAAGTGATTTCGAGAGTGGGAATTCAACAATCCCAAACAAGATCAAGGAATGCAGGTCTTACCCCTTGTACAAGTTTGTGAGACAAGGGTTAGGAACCGAGTTTTTAACTGGAGACAAAGTGAGACCGCCCGGTGAGGAATGTGACAAGGTTTTTGTTGCAATCTGTGAGGGTAAGTTGATTGATCCACTGCTCCAATGTCTCCAAGACTGGAATGGTGCTCCCCTTCCCATCTGTTAA
Protein:  
MVFPNKTDRTSASMRIVQEPVRLMPDVVHAFIEAKVSLDRILKFLEAPELGDRKLLEYCNNDIKSKHSIFIRSDEISWDLNPSSRPTLRNIDLVVKPGEKVAICGEVGSGKSTLLAAVLGEVPKVNGIEYVLKALSDKTVLLVTHQVDFLPVFDSILVRYQPNAPLVLKGISCTFEAGSKIGIVGRTGSGKTTLISVLFRLVEPADGEIIIDNLNICKIGLHDLRSRLGIIPQDPTLFGGSVRYNLDPLEQHSDNEIWEVAVVQDGSNWSVGQRQLFCLGRALLKRSRILVLDEATASIDNATDSIIQKTIRTEFEDCTVITVAHRIPTVMDCNMVLGISDGKLVEYDEPMKLMNMKGSLFGQLVQEYWSRSANNDISPEDCEVKSNMEVSQRKENKSLENPYLNDPLNWGSAAESLKGSHLDEVKRMVNEYRKAVVQLGGQTLTIAQVAAVAAARDNGGRVMVELSESARGGVKASSDWVMDSLNKGTDTYGITTGFGATSHRRTIHGAALQKELIRFLNAGIFGSGNTESCHTMPHTTTRAGMLVRVNTLLQGYSGIRFEILEAIIKLLNHNITPCLPLRGSISASGDIIPYAYISGLLTGRPNSKAVGPKGELLDAKEAFDLAGIDGGFFELQPKEGLALVNGTGVGSGLAAIVLFEANILAVLSQVLSAMFAEVMHGKPEYTDHLIHKLKHHPGQIEAAAIMEHILEGSAFIKAAQKLHEIDPLQKPKQDRYALRSSPQWLGPQVEVIRSSTKSIEREMNSVNDNPLIDVSRNKALHCGNFQGTPIGVSMDNTRLAIASIGKLMFAQLSELVNDIYNNGLPSNLSGGGRHPSLDYGLKGAEIAMAAYCSELQYLANPVTNHVQSTEQHNQDVNSLGLISARKTAEAVDILKLMSSTYMVALCQAIDLRHLEDNLKNAVKNTVSQVAKKVLTCDKDLVKVVDGEHVFSYADDPCNANYPLMQKLRQVLLQHALTLTNVNALKNIGGFEEELKRVLPREVERARSDFESGNSTIPNKIKECRSYPLYKFVRQGLGTEFLTGDKVRPPGEECDKVFVAICEGKLIDPLLQCLQDWNGAPLPIC